Video Summarization Based on Feature Fusion and Data Augmentation
نویسندگان
چکیده
During the last few years, several technological advances have led to an increase in creation and consumption of audiovisual multimedia content. Users are overexposed videos via social media or video sharing websites mobile phone applications. For efficient browsing, searching, navigation across collections repositories, e.g., for finding that relevant a particular topic interest, this ever-increasing content should be efficiently described by informative yet concise representations. A common solution problem is construction brief summary video, which could presented user, instead full so she/he then decide whether watch ignore whole video. Such summaries ideally more expressive than other alternatives, such as textual descriptions keywords. In work, summarization approached supervised classification task, relies on feature fusion audio visual data. Specifically, goal work generate dynamic summaries, i.e., compositions parts original include its most essential segments, while preserving temporal sequence. This annotated datasets per-frame basis, wherein being “informative” “noninformative”, with latter excluded from produced summary. The novelties proposed approach are, (a) prior classification, transfer learning strategy use deep features pretrained models employed. These been used input classifiers, making them intuitive robust objectiveness, (b) training dataset was augmented using publicly available datasets. evaluated three user-generated videos, it demonstrated data augmentation able improve accuracy based human annotations. Moreover, domain independent, any extended rely richer representations modalities.
منابع مشابه
Video Content Summarization and Augmentation Based on Structural Semantic Processing and Social Network Analysis
Video summarization techniques have been proposed for years to offer people comprehensive understanding of a whole story on video. However, although these traditional methods give brief summaries for users, they still do not provide conceptorganized or structural views. Besides, the knowledge they offer to users is often limited to existing videos. In this study, we present a structural video c...
متن کاملVIDEO CLASSIFICATION BASED ON LOW−LEVEL FEATURE FUSION MODEL (WedPmPO2)
This article presents a new system for automatically extracting high−level video concepts. The novelty of the approach lies in the feature fusion method. The system architecture is divided into three steps. The first step consists in creating sensors from a low−level (color or texture) descriptor, and a Support Vector Machine (SVM) learning to recognize a given concept (for example, "beach" or ...
متن کاملFeature based Information Extraction for Generic Video Summarization
Video summarization plays a very significant role in navigating a video, to understand its information or to search the required event information. Our proposed research work minimizes the time required for processing each of the video frames firstly, by reducing their effective size, and then it is followed by an efficient technique for generating the summarized video. The information containe...
متن کاملCountermeasures for Automatic Speaker Verification Replay Spoofing Attack : On Data Augmentation, Feature Representation, Classification and Fusion
The ongoing ASVspoof 2017 challenge aims to detect replay attacks for text dependent speaker verification. In this paper, we propose multiple replay spoofing countermeasure systems, with some of them boosting the CQCC-GMM baseline system after score level fusion. We investigate different steps in the system building pipeline, including data augmentation, feature representation, classification a...
متن کاملLawn Tennis Video Summarization based on Audiovisual and Text Feature Analysis
In this paper, a new video summarization approach for lawn tennis video is presented. The proposed method uses frame color histogram to classify video into play field color shots (PFCS) and non play field color shots (NPFCS). Play field color shots are the segments of interest and used to recognize the tournament class. A dominant colored frame from every PFCS is extracted as a salient frame. O...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computers
سال: 2023
ISSN: ['2073-431X']
DOI: https://doi.org/10.3390/computers12090186